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1. INTRODUCTION 

The data mining techniques are widely used in e-government, especially in the criminology field, 
wherein they help police departments predicate criminals and detect information about crime locations. 
The study of crime is expected not only to control present crime but also to analyze the criminal activities 
so that future occurrences of the same incidents can be overcome. Government is trying to improve 
the effectiveness of prevention of crimes that can be happened in clustering criminal features depending 
on spatial and temporal criteria which are different in each country [1-11]. It also tries to discover 
the correlation between different attributes, such as crime and demographics. Analyzing historical data is useful 
to discover the crime pattern [12]. Past researchers in crime detection depended on data mining and machine 
learning to develop systems that analyze patterns to have the highest probability of occurring and predict 
the regions where such patterns are likely to happen. Some researchers utilized the algorithm of Baum-Welch 
in past studies, which collected GPS data for predicting criminal movements. The work of [13] used attributes 
according to their characteristics and trained an HMM for each cluster, obtaining an accuracy result of 13.85%. 
The authors suggested data mining methods for crime detection and identification of criminal for Indian 
country during the period of 2000 to 2012. They depended on clustering, used k-means, are based on crime 
attributes, and made visualizations using Google Maps. They also used ANNs and forest decision trees (DTs) 
for classification. The outcomes were computed by the Weka tool. The accuracy of the ANN was 90.02. They 
applied 10-fold cross validation for the forest DTs, the accuracy in RMSE was 0.0751, and the time taken to 
build the model was 4.24 s [14]. In [15], the proposed approach helped agencies to emphasize the security 
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in Indian, their work needs to improve with more data mining method. Other researchers considered the use of 
the Viterbi algorithm for identifying locations with a high probability of having a criminal depending 
on the relationship between the location and types of crimes, such as murder, thief, and assault, they handle 
idling time rather than accuracy problem. The Latent hidden Markov models are types of algorithms that have 
been designed to detect crime activities by obtaining a sequence of observations from hidden values. 
They developed an algorithm that fits regular vine copula to generate tree structures. These trees had been 
employed to produce an emission matrix for hidden Markov model (HMM) algorithm, the fusion of coupled 
parameters with two types of HMM algorithms, this work needs to improve accuracy and reduce execution 
time. [1]. In this work, we attempt to overcome the problems in previous studies and develop the work of past 
researcher [1, 3, 14], by proposing a hybridization between Baum Welch algorithm and Viterbi algorithms 
in a side, and Viterbi algorithms for optimization of data set then applied DT, at the other side. The objective 
of proposed work to handles an important topic of monitoring criminal activities and movements and indicates 
the level of danger in locations, the preposd hybridization method reduces the consumed time and increases 
accuracy, respectively. 


2. HMM 

The HMM is a robust model which can be used when states in a process are not observable, but 
observed data depends on these hidden states. HMM depends on two main properties, which are: 
— The observation at time ¢ is produced by a process whose state H, is hidden from the observer. 
— The state of the hidden process represents the Markov chain [16-23]. 

At the other hand, the Viterbi is a dynamic programming algorithm that depends on transition and 
emission matrices. In particular, the Viterbi algorithm can obtain path (state sequences) to generate output 
sequences. It works by finding the maximum overall possible state sequence by considering a forwarding 
algorithm. The Baum (1970) proposed the use of the Baum—Welch algorithm based on the computation of 
the probabilistic methods of the Markov model. It also called the forward-backward method, wherein 
the backward part represented the probability of partial observation sequences from the time (¢+1) to end, can 
be computed iteratively [24]. 


3. OVERVIEW OF ADOPTED DATASETS 

In this work, two datasets have been used in the criminal field. The first one is the Iraqi dataset, while 
the second dataset is India dataset [25]. The Iraq dataset consists of features such as as{age, gender, ID, crime 
types, locations, gang, longitude, latitude} the type of features is categorization whereas the features of Indian 
data set is included {states, murder, attempted to murder ,..., thief}, making a partitioning based on clustering 
needs more investigation and analysis of the contexts in a discrete sequence dataset. Here, the needed to predict 
the locations of the crimes, and the HMM to obtain the hidden pattern is required. As mentioned earlier, 
the dataset is collected from a website and social media, where Baghdad city can be divided into different main 
locations as shown in proposed Table 1. 

While the representation of the second dataset is shown in propsed Table 2. The proposed system is 
also applied to India dataset. As mentioned above, the HMM must generate ten locations: Locl, Loc2, ...., 
Loc32, where these locations represent the workflow of the proposed system in the clustering. Here, the needed 
to predict the locations of the crimes, and the HMM to obtain the hidden pattern in this work as well as the two 
hidden states with one observe the state. 














Table 1. Location of Iraq representation Table 2. Location of Indian states representation 
Seq Location Representation state Representation 
1 Adamya Location 1(Loc_1) 1 ANDHRA PRADESH Location 1(Loc_1) 
2 Albayaa Location 2(Loc_2) 2 ARUNACHAL PRADESH Location 2(Loc_2) 
3 Hayalamel Location 3(Loc_3) 3 ASSAM Location 3(Loc_3) 
4 kadmiya Location 4(Loc_4) 4 BIHAR Location 4(Loc_4) 
5 Madaan Location 5(Loc_5) 5 CHHATTISGARH Location 5(Loc_5) 
6 Mansour Location 6(Loc_6) 
7 Baghdadgedeeda Location 7(Loc_7) 
8 Hasania Location 8(Loc_8) : ; 
9 Madeena Location 9(Loc_9) 32 PUDUCHERRY Location 
10 Abogreeb Location 10(Loc_10) 32(Loc_32) 
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4. PROPOSED METHOD 

The Viterbi algorithm as an optimizer algorithm in two strategies is proposed. The first one is built 
with Baum Welch, while the second one has been created in combination with the DT algorithm. Viterbi is 
a supervised clustering algorithm that makes partitioning-based clustering for learning contexts in a discreet 
sequence dataset. And it works as an outlier detector that removes low-probability data (abnormal data) that 
explains in detail the following section. 


4.1. Hybridization Viterbi algorithm with Baum Welch 

We explain the elements of the proposed workflow that include the hybridization of the Viterbi 
algorithm with Baum Welch to appear in three stages as shown in Figure 1. The Viterbi plays a major role 
in enhancing the clustering model; in addition, the dataset formulates the structure of the dynamic model 
using vine while the movement of criminals is determined by the generation of dynamic transition matrix 
from dataset. 





Generate Emission Matrix 
— 


Initial 
Sequence 
of locations 
















Compute 
Transition 
Matrix 









iterbi 
alg 
















Most 
probable 
crimes 





Estimation 
of crimes 
locations 


Figure 1. Hybridization Viterbi algorithm with Baum Welch 


4.1.1. Emission matrix generation stage 

The structure of matrix is generated using a simple vine couple to produce a simple tree structure 
strategy for selecting a tree model based on vine algorithm and generating a compact model. Conditional 
probability (Bayes theorem) is applied to the tree structure and used for assuming that the couple parameter 
{month, location} is combined with crime types to produce three dimensions rather than two dimensions. 
The Emission matrix can be embedded into HMM (Baum-Welch and Viterbi algorithms) and evaluated 
using in (1), 


P(loc1/ murder,Sep) =- P(locN/ murder, Sep) 
Emi = ; “e : (1) 
P(loc1/ Others,Oct) ++ P(locN/ Others, Oct) 


4.1.2. Transition matrix generation stage 

The transition matrix is used to represent the Markov chain, and it is defined as a set of (location) 
states (S = {sl_, s2..., sn}), where Locations are represented by criminal movements in a different location. 
The transition is represented as the square array Tnxn, where N= 10, for Iraq dataset, and N=32 for India dataset. 
The transition matrix can be construction as the following algorithm, where the summation of the probability 
of each row in the matrix must be an equal one. 
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An algorithm of Transition Matrix 

Function Transition (x), where x is location of crime 
Returns Probability matrix of criminal’s movements 

m = maximum value (x); 

y = zeros(m,1) \\ vertice 

construct probability matrix p mm 

for k=1to n-1 





y(x(k)) = y(x(k)) + 1; 
p(x(k), x(k+1)) = p(x(k), x(k+1)) + 1; where k is the number of movements 
end loop 
if y ==0 then 
p=0 
else 
p = (p div y); 
end if 





4.1.3. Sequence generation stage 
The sequence is the third parameter of HMM, in this step is frist contribution which defines 
the sequence of crime that happens, to obtain the most probable location of (crimes)states. In traditional and 
proposal work, the switch has been made between states and sequences (crimes) in two important phases: 
— Replacing the sequence value with a state value, (i.e. the sequence as coupled parameters {crime types} 
is replaced with {locations}. 
— Inserting states (locations) as a sequence into the Viterbi algorithm to generate the most probable sequence 
of crimes as an output. 


4.2. DT algorithms with Viterbi algorithms 
Here, the second strategy of improving decision tree algorithm is explained. The adopted steps of 

the proposed algorithm are illustrated in Figure 2 following the steps of: 

Step 1: Start. 

Step 2: Generate initialize the value of the EIMS matrix as mention in section 2 that computes the probability 
between {crimes type, month} and {locations}. 

Step 3: Generate initialize the value of the TRNS matrix, in the same way, that mention in section, to 
determine criminal’s movements. 

Step 4: A sequence of couple parameter initialization. 
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Figure 2. DT algorithms with Viterbi algorithms 
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Step 5: apply the Viterbi algorithm which can be used as an optimization algorithm by the elimination of 
the location with low probability as an outlier to maximize to increase the efficiency of time execution, 

Step 6: Remove locations with low probability from the dataset. 

Step 7: Generate new dataset, with improved quality of the dataset. 

Step 8: Apply Decision Tree algorithm with hyperparameter, obtaining Nid as a label for first dataset, and 
kidnapping as a label for the second dataset. 

Step 9: End 


5. CONTRIBUTIONS 

The Vine Copula Baum-Welch algorithm can achieve good results with the hybrid Vine Copula 
Viterbi algorithm for determining the sequential relation of past crime types, date, and locations. The speed 
of execution and accuracy of the Baum-Welch algorithm are enhancedNid of Iraq can help to facilitate 
the identification of criminals by the police and checkpoints. The contribution of this work, including a dataset, 
is optimized using a Viterbi algorithm for outlier detection and generates a new dataset.which is used to 
generate DT. 


6. RESULT AND IMPLEMENTATION 

The implementation of the proposed algorithm in both strategies can be explained in numerous stages: 
Stage 1: Generate the Emission matrix, Each value of the probability matrix related to crime type in a specific 
location. The main locations can then be determined from the column of crime type after transposition, 
as mention Emission matrix initially It was built by the implementation of the theory of Bayes, then was trained 
using latent Markov method ,the Implementation of generated matrix of the first dataset is shown in Table 3. 
While the Implementation of generated Emission matrix of the second dataset is shown in Table 4. The two 
features selected {thief, other IPC crimes}, the porobabilty crime occring appering in (32) states in indain 
country, as mention Emission matrix initially It was built by Bayes thory, then was trained using latent 
Markov method. Finally, The Emission matrix had been constructed with three parameters which had the main 
role HMM algorithms. 


Table 3. The implementation emission matrix of the frist dataset 











Locl Loc2 Loc3 Locl0 
Theft 0.1250 0 0 mi 0 
Murder 0.3333 0 0 e 0 
Others 0.5000 0 0 sie 0 
Theft 0.4286 0.2857 0 ae 0 
Murder 0.0213 0.4894 0.1277 wi 0.2340 
Others 0 1.0000 0 s 0 





Table 4. Emission matrix of the second dataset 








locl loc2 lOSA — miin loc32 
Thief 0.0538 0.0010 OOII aaka 0.0076 
Other IPC Crimes 0.0019 0 O02590 E vaes 0.0019 





Stage 2: Generating a transition matrix for both datasets. The locations representation as a state diagram of 
Baghdad city is shown in Figure 3. It is important to note that the transition stages express the movement of 
criminals among location. The locations are represented in digraph of the Markov chain. Where the circle 
shape represents the nodes (cities) in which the crimes occurred and the edges connecting the nodes between 
them, to represent the possibility of transition between nodes. 
Stage 3: Entering the sequence of crimes for both datasets. 
Stage 4: Table 5 to Table 6 illustrates the comparison of the proposed algorithm with the traditional methods 
in terms of accuracy, relative mean square error (RMSE) and consumed time. The RMSE is considered 
for accuracy expression for HMM methods that includes Baum Welch and the proposed Baum Welch+Vetribi 
as shown in Table 5. It is well shown that the proposed algorithm for both datasets decreases the RMSE, 
clearly by (0.13). 

At the other hand, Table 6 explains the comparison between the proposed (DT with Viterbi) and 
traditional DT. In this table, the accuracy of a confusion matrix is considered as the main indicator. The DT 
based on Matlab library for implementation.The accuracy of DT before the optimization is approximately 
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(98%) for the first dataset, whereas it is nearly (93.8%) for the second dataset. The accuracy of DT after 
optimization is improved by approximately (98.8% ) for the first dataset. For the second dataset, the accuracy 
is also improved by around (95.9%) after optimization. The obtained results show that the proposed DT+ Viterbi 
increases the accuracy for both datasets. 





> ayalamel 





Figure 3. State transition before optimization of Iraq dataset 


Table 5. Comparison of results for RMSE 
Baum Welch [2] Baum Welch +Viterbi (preposed work) 
Dataset 0.17 0.12 
Dataset2 0.04 0.03 











Table 6. Comparison of results for accuracy in the confusion matrix 
Accuracy in the confusion matrix 
DT [3] Prepoded DT+ viterbi (preposed work) 
Dataset] ~%98 ~%98.8 
Dataset2 ~%93.8 ~%95.9 











Table 7. Shows the time-consuming comparison of the proposed methods (Baum Welch + Viterbi) 
with the traditional (Baum Welch) algorithms. Applied in two datasets, both improved to 0.05 and 0.08 
respectively. It is proved that the proposed methods including Baun Welch outperform the traditional 
approaches which had been improved by (~0.15 sec) while Deepika K.K and at [14], achieves 4.24 s with 
0.0751 as RMSE. 


Table 7. Comparison of results for the consumed time 
Consumed Time in (second) 
Baum Welch [1] Baum Welch +Viterbi (preposed work) 
Dataset1 0.2 0.05 
Dataset2 0.1 0.08 











7. CONCLUSION 

In this paper, a proposed HMM has produced to predicate the level of danger in a specific region. 
The combination was done using Baum Welch with Viterbi from a side and DT with Viterbi at the other side 
to get a more accurate model. The proposed methods can achieve promising results for considering the Viterbi 
algorithm to determine the sequential relation of past crime types, with two latent parameters {month and 
locations}. The optimization of using DT method helped the proposed algorithm in mining the optimal solution. 
In addition, the consumed time is reduced in efficient performance. 
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